Exploration via State influence Modeling

نویسندگان

چکیده

This paper studies the challenging problem of reinforcement learning (RL) in hard exploration tasks with sparse rewards. It focuses on stage before agent gets first positive reward, which case, traditional RL algorithms simple strategies often work poorly. Unlike previous methods using some attribute a single state as intrinsic reward to encourage exploration, this leverages social influence between different states permit more efficient exploration. introduces general construction method evaluate dynamically. Three kinds are introduced for state: conformity, power, and authority. By measuring state’s influence, agents quickly find focus during process. The proposed framework evaluation works well task. Extensive experimental analyses comparisons Grid Maze many Atari 2600 games demonstrate its high efficiency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Shared-Memory State-Space Exploration in Stochastic Modeling

Stochastic modeling forms the basis for analysis in many areas including biological and economic systems as well as the per formance and reliability modeling of computers and communication net works One common approach is the state space based technique which starting from a high level model uses depth rst search to generate both a description of every possible state of the model and the dynami...

متن کامل

State Exploration with Multiple State Groupings

Exploration algorithms are relevant to the industrial practice of generating test cases from an abstract state machine whose runs define the predicted behavior of the software system under test. In this paper we describe a new exploration algorithm that allows multiple state grouping functions to simultaneously guide the search for states that are interesting or relevant for testing. In some ca...

متن کامل

The Bayesian Echo Chamber: Modeling Social Influence via Linguistic Accommodation

We present the Bayesian Echo Chamber, a new Bayesian generative model for social interaction data. By modeling the evolution of people’s language usage over time, this model discovers latent influence relationships between them. Unlike previous work on inferring influence, which has primarily focused on simple temporal dynamics evidenced via turn-taking behavior, our model captures more nuanced...

متن کامل

Reliable Modeling of Ideal Generic Memristors via State-Space Transformation

The paper refers to problems of modeling and computer simulation of generic memristors caused by the so-called window functions, namely the stick effect, nonconvergence, and finding fundamentally incorrect solutions. A profoundly different modeling approach is proposed, which is mathematically equivalent to windowbased modeling. However, due to its numerical stability, it definitely smoothes th...

متن کامل

Steady-state parameter sensitivity in stochastic modeling via trajectory reweighting.

Parameter sensitivity analysis is a powerful tool in the building and analysis of biochemical network models. For stochastic simulations, parameter sensitivity analysis can be computationally expensive, requiring multiple simulations for perturbed values of the parameters. Here, we use trajectory reweighting to derive a method for computing sensitivity coefficients in stochastic simulations wit...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i9.16981